Document Summarization with Latent Queries

نویسندگان

چکیده

Abstract The availability of large-scale datasets has driven the development neural models that create generic summaries for single or multiple documents. For query-focused summarization (QFS), labeled training data in form queries, documents, and is not readily available. We provide a unified modeling framework any kind summarization, under assumption all are response to query, which observed case QFS latent summarization. model queries as discrete variables over document tokens, learn representations compatible with unobserved query verbalizations. Our formulates generative process, jointly optimizes conditional language model. Despite learning from only, our approach outperforms strong comparison systems across benchmarks, types, settings, target domains.1

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Topic-based Multi-Document Summarization with Probabilistic Latent Semantic Analysis

We consider the problem of query-focused multidocument summarization, where a summary containing the information most relevant to a user’s information need is produced from a set of topic-related documents. We propose a new method based on probabilistic latent semantic analysis, which allows us to represent sentences and queries as probability distributions over latent topics. Our approach comb...

متن کامل

Latent semantic sentence clustering for multi-document summarization

This thesis investigates the applicability of Latent Semantic Analysis (LSA) to sentence clustering for Multi-Document Summarization (MDS). In contrast to more shallow approaches like measuring similarity of sentences by word overlap in a traditional vector space model, LSA takes word usage patterns into account. So far LSA has been successfully applied to different Information Retrieval (IR) t...

متن کامل

Single Document Summarization with Document Expansion

Existing methods for single document summarization usually make use of only the information contained in the specified document. This paper proposes the technique of document expansion to provide more knowledge to help single document summarization. A specified document is expanded to a small document set by adding a few neighbor documents close to the document, and then the graphranking based ...

متن کامل

Hierarchical Summarization: Scaling Up Multi-Document Summarization

Multi-document summarization (MDS) systems have been designed for short, unstructured summaries of 10-15 documents, and are inadequate for larger document collections. We propose a new approach to scaling up summarization called hierarchical summarization, and present the first implemented system, SUMMA. SUMMA produces a hierarchy of relatively short summaries, in which the top level provides a...

متن کامل

Multi-Document Summarization with Subjectivity Analysis

In this paper, we present our team TUT/NII results at DUC 2005 and additional experiments on improving multi-document summarization. Summarization systems have typically focused on the factual aspect of information needs. Subjectivity analysis is another essential aspect for better understanding of information needs. Our approach is based on sentence extraction, weighted by sentence type annota...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Transactions of the Association for Computational Linguistics

سال: 2022

ISSN: ['2307-387X']

DOI: https://doi.org/10.1162/tacl_a_00480